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Abstract. We investigate the relationship between the sizes of the sum and differ- 
ence sets attached to a subset of {0, 1, A^}, chosen randomly according to a binomial 
model with parameter p(A^), with N^^ — o{p{N)). We show that the random subset 
is almost surely difference dominated, as A^ — + oo, for any choice of p{N) tending 
to zero, thus confirming a conjecture of Martin and O'Bryant. The proofs use recent 
strong concentration results. 

Furthermore, we exhibit a threshold phenomenon regarding the ratio of the size of 
the difference- to the sumset. If p{N) — o{N^^/^) then almost all sums and differ- 
ences in the random subset are almost surely distinct, and in particular the difference 
set is almost surely about twice as large as the sumset. If N^-^^^ = o{p{N)) then both 
the sum and difference sets almost surely have size (2A^ + 1) — 0{p{N)~'^), and so 
the ratio in question is almost surely very close to one. If p{N) — c • N^^^^ then as c 
increases from zero to infinity (i.e., as the threshold is crossed), the same ratio almost 
surely decreases continuously from two to one according to an explicitly given function 
of c. 

We also extend our results to the comparison of the generalized difference sets at- 
tached to an arbitrary pair of binary linear forms. For certain pairs of forms / and g, 
we show that there in fact exists a sharp threshold at c/ g • N^^^^, for some computable 
constant c/.g, such that one form almost surely dominates below the threshold, and the 
other almost surely above it. 

The heart of our approach involves using different tools to obtain strong concentra- 
tion of the sizes of the sum and difference sets about their mean values, for various 
ranges of the parameter p. 



1. Introduction 

To know whether a random variable is strongly concentrated is an issue of fundamen- 
tal importance in many areas of mathematics and statistics. In this paper we apply recent 
results of Kim and Vu HKiVui |Vul[ |Vu2|| to completely solve a combinatorial number 
theory question on the size of difference- and sumsets of integers. A classical strong 
concentration result (due to Chemoff) states that ifY = J2^=i ^« ^^^^ '^he U i.i.d. binary 
random variables, then for any A > we have Prob(|y — IE[V]| > VXn) < 2e~^/^. 
Within number theory, this result was used by Erdos (see HASH . Chapter 8) to prove the 
existence of so-called 'thin' bases of N of order 2. The general requirement for many 
applications is to obtain Chemoff-like exponential deviation bounds in situations when 
the atom variables U are not independent. For modern surveys of strong concentration 
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inequalities see, for example, [Ta] and f Vulll ; the latter, in particular, contains a fine se- 
lection of applications in random graph theory, combinatorial number theory and finite 
geometry. 

The specific result we shall utilise is a martingale inequality which appears as Lemma 
3.1 in [|Vu2ll . It is an extension of the classical Azuma inequality ( flASl . Chapter 7) to 
functions whose Lipschitz coefficients are small 'on average'. As remarked in [|Vu2L 
this type of inequality is very general and robust, and is expected to be applicable in 
numerous situations; this is definitely true for our problem. 

Let S* be a subset of the integers. We define the sumset S + S and difference set S — S 

by 

S + S = {si + S2 ■■ Si e S} 

S-S = {s^-S2:s,eS}, (1.1) 

and denote the cardinality of a set A by \A\. As addition is commutative and subtrac- 
tion is not, a typical pair of integers generates two differences but only one sum. It 
is therefore reasonable to expect a generic finite set S will have a larger difference set 
than sumset. We say a set is sum dominated (such sets are also called more sums than 
differences, or MSTD, sets) if the cardinality of its sumset exceeds that of its difference 
set. If the two cardinalities are equal we say the set is balanced, otherwise difference 
dominated. Sum dominated sets exist: consider for example {0, 2, 3, 4, 7, 11, 12, 14} 
(see IHe, MS, Na2] for additional examples). In UNall . Nathanson wrote "Even though 
there exist sets A that have more sums than differences, such sets should be rare, and 
it must be true with the right way of counting that the vast majority of sets satisfies 
\A-A\> \A + A\." 

Recently Martin and O'Bryant HMOU showed there are many sum dominated sets. 
Specifically, let 1^ = {0, . . . , A^}. They prove the existence of a universal constant 
KsD > such that, for any > 14, at least k^d ■ 2^+^ subsets of 1^ are sum dominated 
(there are no sum dominated sets in /13). Their proof is based on choosing a subset of 
In by picking each n E In independently with probability 1/2. The argument can be 
generalized to independently picking each n E In with any probability p E (0, 1), and 
yields the existence of a constant ksb,p > such that, as A^ — > 00, a randomly chosen 
(with 

respect to this model) subset is sum dominated with probability at least /tsD,p- Sim- 
ilarly one can prove there are positive constants kdd.p and kb,p for the probability of 
having a difference dominated or balanced set. 

While the authors remark that, perhaps contrary to intuition, sum dominated sets are 
ubiquitous, their result is a consequence of how they choose a probability distribution 
on the space of subsets of In- Suppose p = 1/2, as in their paper. With high probability 
a randomly chosen subset will have A^/2 elements (with errors of size a/A). Thus the 
density of a generic subset to the underlying set In is quite high, typically about 1/2. 
Because it is so high, when we look at the sumset (resp., difference set) of a typical A 
there are many ways of expressing elements as a sum (resp., difference) of two elements 
of A. For example (see [MQ|). if k E A+Athenthereareroughly A^/4— |A^— A;|/4 ways 
of writing /c as a sum of two elements in A (similarly, if k E A — A there are roughly 
A^/4 — I A; 1/4 ways of writing as a difference of two elements of A). This enormous 
redundancy means almost all numbers which can be in the sumset or difference set are. 
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In fact, using uniform density on the subsets of 1^ (i.e., taking p = 1/2), Martin and 
O'Bryant show that the average value of |y4 + A| is 2iV - 9 and that of |y4 - A] is 2iV - 5 
(note each set has at most 2N + 1 elements). In particular, it is only for k near extremes 
that we have high probability of not having k in an A + A or an A — A. In HMOU they 
prove a positive percentage of subsets of In (with respect to the uniform density) are 
sum dominated sets by specifying the fringe elements of A. Similar conclusions apply 
for any value ofp> 0. 

At the end of their paper, Martin and O'Bryant conjecture that if, on the other hand, 
the parameter p is a function of tending to zero arbitrarily slowly, then as — > oo 
the probability that a randomly chosen subset of 1^ is sum dominated should also tend 
to zero. In this paper we will, among other things, prove this conjecture. 

We shall find it convenient to adopt the following (fairly standard) shorthand nota- 
tions. Let X be a real- valued random variable depending on some positive integer pa- 
rameter A^, and let f{N) be some real- valued function. We write 'X ~ f{Ny to denote 
the fact that, for any ei, 62 > 0, there exists A^^i^ej > such that, for all N > N^-i^e2^ 

P (X ^ [(1 - e^)f{N), (1 + ei)/(X)]) < e,. (1.2) 

In particular we shall use this notation when X is just a function of N (hence not 
'random'). In practice X will in this case be the expectation of some other random 
variable. 

By f{x) = 0{g{x)) we mean that there exist constants xq and C such that for all 

X > xq, \ f{x)\ < Cg{x). By f{x) = <d{g{x)) we mean that both f{x) = 0{g{x)) 
and g{x) = 0{f{x)) hold. Finally, ii \im.x^oo f {x) / g{x) = then we write f(x) = 
o{g{x)). 

Our main findings can be summed up in the following theorem. 

Theorem 1.1. Let p : N ^ (0, 1) be any function such that 

AT-i = o{p{N)) and p{N) = o(l). (1.3) 

For each N G N let A be a random subset of 1^ chosen according to a binomial 
distribution with parameter p{N). Then, as N ^ 00, the probability that A is difference 
dominated tends to one. 

More precisely, let 3l denote respectively the random variables \A ^ A\ and 
\A — A\. Then the following three situations arise : 

(i) p{N) = o{N-^/^) : Then 

y ~ (^-P^^))' and ^~2^ ~ {N-p{N))\ (1.4) 

(ii) p{N) = c ■ N'^/"^ for some c G (0, 00) .• Define the function g : (0, 00) (0, 2) by 

:= 2 ( '''-^' - (1.5) 



X 



Then 



- g{^]N and ^ ~ g{c')N. (1.6) 
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(in) Ar-V2 ^ o{p{N)) : Let 5^^ := (2A^ + 1) - ^ := (2A^ + 1) - ^. Then 

Remark 1.2. Obviously, not all functions p : N ^ (0, 1) satisfying (1.3) conform to the 
requirements of (i), (ii) or (iii) above, but these are the natural functions to investigate 
in the current context. Similar remarks apply to Theorem 3.1 and Conjecture 4.2 below. 

Theorem I 1 . 1 [ proves the conjecture in [MO] and re-establishes the validity of Nathanson's 
claim in a broad setting. It also identifies the function N^^^"^ as a threshold function, 
in the sense of nJLRL for the ratio of the size of the difference- to the sumset for a 
random set A C J^. Below the threshold, this ratio is almost surely 2 + o(l), above it 
almost surely H-o(l). Part (ii) tells us that the ratio decreases continuously (a.s.) as the 
threshold is crossed. Below the threshold, part (i) says that most sets are 'nearly Sidon 
sets', that is, most pairs of elements generate distinct sums and differences. Above the 
threshold, most numbers which can be in the sumset (resp., difference set) usually are, 
and in fact most of these in turn have many different representations as a sum (resp., a 
difference). However the sumset is usually missing about twice as many elements as 
the difference set. Thus if we replace 'sums' (resp., 'differences') by 'missing sums' 
(resp., 'missing differences'), then there is still a symmetry between what happens on 
both sides of the threshold. 

We prove Theorem 11.11 in the next section. Our strategy will consist of first estab- 
lishing an estimate for the expectation of the random variable or Si, followed by es- 
tablishing sufficiently strong concentration of these variables about their mean values. 
For the second part of this strategy we will use different approaches for p = 0{N^^^'^) 
and A^"^/^ = o{p{N)). In the former range a fairly straightforward second moment 
argument works. In the latter range, however, we will employ a specialization of the 
Kim-Vu martingale lemma (Lemma 3.1 in [iVu2ll . Lemma 2.2 below). 

In Section 3, we extend our result to arbitrary binary linear forms. The paper UNOORSI 
provides motivation for studying these objects. By a binary linear form we mean a 
function /(x, y) = ux + vy where u,v E Z^qj w > |f | and GCD('u, v) = 1. For a set A 
of integers we let 

f{A) := {uai + va2 : ai, 02 G A}. (1.8) 

Except in the special case m = f = 1 we always have that f{x, y) ^ f{y, x) whenever 
X ^ y. Thus we refer to / as a difference form and a set f{A) as a generalized differ- 
ence set, whenever u > \v\. Theorem 13 . 1 1 allows us to compare the sizes of f{A) and 
g{A) for random sets A, and arbitrary difference forms / and g when N^^^^ = o{p(N)). 



Two situations arise : 



(a) for some pairs of forms, the same one a.s. dominates the other for all parame- 
ters p = p{N) in this range. In fact every other difference form dominates x — y, and 
hence also x + y. 

(b) for certain pairs / and g, something very nice happens. Namely, there is now a sharp 
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threshold, in the sense of r jLRH . at Cf,gN-^/^, for some computable constant cj^g > 0, 
depending on / and g. One form dominates a.s. below the threshold, and the other one 
a.s. above it. This fact may be considered a partial generalization of the main result 
of UNOORSI to random sets, partial in the sense that is only applies to certain pairs of 
forms. Namely, they proved that for any two forms / and g (including x + y), there 
exist finite sets y4i and such that |/(v4i)| > |5'(v4i)| whereas |/(v42)| < |fi'(v42)|. 

We leave it to future work to investigate what happens as the threshold is crossed in 
this situation. 

In Section 4 we make a brief summary of this and other remaining questions and make 
suggestions for other problems to study. In particular, we suggest looking at other prob- 
abilistic models for choosing random sets. This is partly motivated by the fact that 
our results in Section 3 only apply when N^^^^ = o{p{N)). The reason is that, for 
faster decaying p{N), as we shall see, the variance in the size of the random set A it- 
self swamps all other error terms, and it is meaningless to compare and 1(7(^)1; 
in other words, the model itself becomes useless. This may be considered a problem 
when N-^/^ = o{p{N)). For p{N) = o(A^~^/^), the results of HGJLR I imply that all 
pairs {x,y) in a random set a.s. generate different values f{x,y), for any /, so that 
1/(^)1 = 1^(^)1 a.s. for any / and ^. 

2. Proof of Theorem [UJ 

Our strategy for proving the various assertions in Theorem 1 1.1 1 is the following. Let 
be one of the random variables ^, y^, as appropriate. We then carry out 
the following two steps : 

Step 1 : Prove that behaves asymptotically as asserted in the theorem. 

Step 2 : Prove that ^ is strongly concentrated about its mean. 

As already mentioned, the calculations required to perform these two steps differ ac- 
cording as to whether p(A^) = 0(iV~^/^) or iV~^/^ = o{p{N)). In particular, in the 
former case. Step 2 is achieved by a fairly straightforward second moment argument, 
whereas a more sophisticated concentration inequality is used in the latter case. We 
thus divide the proof of the theorem into two separate cases, depending on the parame- 
ter function p. 

Throughout the paper we often abuse notation to save space, writing p for p{N). As 
we never consider the case where p{N) is constant (as this case has been analyzed in 
HMOID . this should not cause any confusion. 

Casel :p(A^) = 0{N-^/^). 

We first concentrate on the sumset and prove the various assertions in parts (i) and 
(ii) of the theorem. The proofs for the difference set will be similar. For any finite set 
A C No and any integer A; > 1, let 

■■= {{{ai, 02}, • • • , {a2k-i, a2fc}} : ai + a2 = ■ ■ ■ = 02^-1 + a2fe} • (2.1) 
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In words, Ak consists of all unordered fc-tuples of unordered pairs of elements of A 
having the same sum. Let := \Ak\. So if ^4 is a random set, then each X/. is a non- 
negative integer valued random variable. The crucial observation for our work is that, 
in the model we are considering, the random variables X^ are all highly concentrated : 

Lemma 2.1. Forp{N) = 0{N^^^'^) we have for every k that 

(fc + 1 ^ 
, ^, - — — o{p{N)). 

Proof. We write p for p(A^). By the central limit theorem it is clear that 

1^1 ~ Np. (2.3) 

Each Xk can be written as a sum of indicator variables Y^, one for each unordered k- 
tuple a. There are two types of A;-tuples : those consisting of 2k distinct elements of Jjv 
and those in which one element is repeated twice in one of the k pairs, and the sum of 
each of the k pairs is even. The probability of any /c-tuple of the former type occurring 
in Ak is p^'^, whereas for A; -tuples of the latter type this probability is p^^~^. Let there 
be a total of ^i,fe(A^) A; -tuples of the former type and ^2,k{N) of the latter type. Then, by 
linearity of expectation, 

nXk] = i,,k{N) . p"^ + i2,k{N) ■ p''-'. (2.4) 

We have 

n=0 ^ ^ 

where R{n) is the number of representations of n as a sum of two distinct elements of 
In, and hence we easily estimate 

n=2k ^ ^ n=2k ^ 

Thus 



A similar calculation shows that ^2,it(-/V) = Ofc(A^'=), hence ^2,it(-/V)-p^''~^ = Ofe(A^V''~^)- 
Since N"^ — o{p) it follows that 

in accordance with the lemma. To complete the proof of the lemma, we need to show 
that, whenever N~(^) — o(p), the random variable becomes highly concentrated 
about its mean as — > oo. We apply a standard second moment method. In the 
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notation of HASH . Chapter 4, since we already know in this case that E[Xfc] ^ oo, it 
suffices to show that A = o(E[Xfc]2) = Ok{N^''+^p^''), where 

A = J2^{Y^^Yf,), (2.9) 

the sum being over pairs of fc-tuples which have at least one number in common. It is 
easy to see that, since N^'^ = o{p), the main contribution to A comes from pairs {a, P} 
of /c-tuples, each of which consist of 2A; distinct elements of /at, and which have exactly 
one element in common. The number of such pairs is Ok{N'^^^^) since there are 

• Ok{N'^~^^) choices for a, 

• 2k choices for the common element with P, 

• 0{N) choices for the sum of each pair in P, 

• Ok{N^~^) choices for the remaining elements in /3. 

Since a total of Ak—1 elements of In occur in total in aU/?, we have ¥{Ya/\Yi3) = p'^''~^. 
Thus 

A = 0,,fc(Ar2^+V^-^) = o,,fc(Ar2^+V*^), since A^-i = o(p). (2.10) 

This completes the proof of Lemma 2.1. □ 

We can now prove parts (i) and (ii) of the theorem. First suppose p = o{N^^^'^). By 
we have ~ ^N^, whereas X2 ~ j^N^ for N'^/^ = o{p) and EfXa] = 
0(1) otherwise. 

Since p = o{N^^/'^) we have max(l, N^p'^) = o{N'^p^) and thus X2 = o{Xi) almost 
surely. In other words, as ^ cxd, all but a vanishing proportion of pairs of element of 
A will have distinct sums. It follows immediately that 

y ^ X, ^ ^y^, (2.11) 

as claimed. 

Now suppose p = cN^^^"^ for some fixed c > 0. This time we will need to con- 
sider all the Xk together. Let 0^ be the partition on Ai whereby {ai, 02} and {03, 04} 
are in the same part if and only if ai + 02 = 03 + 04. For each i > let ti denote the 
number of parts of size i (as a random variable). Observe that 



=^ = X]^* (2.12) 



=0 



and, for each k >\, that 



5^ ( M r, = X,. (2.13) 

(12.131) is a system of infinitely many equations in the variables x;, which together deter- 
mine y . For any m > 1, the general solution of the subsystem formed by the first m 
equations (i.e.: /c = 1, . . . , m) is readily checked to be 

k=l k=m+l I j=0 ^ ^ J 
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Regarding the second sum on the right of (|2.14l) we have that 



E EM) 

k=m+l L i=0 



< 



E 

k=m+l 



k 

m 



Tk 



Hence it follows that, for any m > 1, 

m 



k=l 



(2.15) 



(2.16) 



Now for p = cN Lemma 2.1 says that 

Xry, ~ 2- 



m + l)\ 



(2.17) 



and since 



(m+l)! 



as m ^ oo, another application of Lemma 2.1 implies that 



k=l 



\ 

So to prove (|1.6I) it just remains to verify that 



E 

k=l 



\ 



(k + l) 



■N. 



(2.18) 



(-1) 



(7(x) = 2-y 

(fc + i) 



(2.19) 



which is an easy exercise. 

This proves parts (i) and (ii) of Theorem 1.1 for the sumset. For the difference set 
one reasons in an entirely parallel manner. One now defines, for each k > 1, 

^'k '■= «2), • • • , (a2fc-i, 0'2k)} : ai - 02 = ■ ■ ■ = 02^-1 - a2fc 7^ 0} . (2.20) 

In words, A'^ consists of all A;-tuples of ordered pairs of elements of A which have the 
same non-zero difference. We let := \Ak\ and in a completely analogous manner to 
Lemma 2.1 prove that 



nx'k] 



2k jvrfc+l 



(fc + 1)!^ 



(2.21) 



and that X^ ~ IE[X^] whenever A^"l^) = o{p). We define the partition of A[ in 
the obvious way and let denote the number of parts of size i, for each i > 1. Since 
^ = 1 + Yl°li follow exactly the same analysis as above to deduce (11.41) and 

(11.61 ). This completes the proofs of parts (i) and (ii) of the theorem. 

Case II : iV- 1/2 _ o{p{N)). 

Recall p{N) = o(l). Set p = p{N) and P = 1/p; thus P = o{N^/'^) (as p = o(l) 
we have limAr^oo P = 00). Again we begin with the sumset. Recall the two steps to be 
accomplished : 



Step 1 : We prove that E[y^] ~ AP^. 
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Step 2 : We prove that the random variable S^^ is strongly concentrated about its mean. 

We begin with the simpler Step 1. For each n G let denote the event that 
A + A. Thus 

2N 

E[^^] = ^P(^„). (2.22) 

n=0 

Observe that P(^n) = ^{<^2N-n)- Since all the ways of representing any given n as a 
sum of two elements of In are independent of one another, we have, for n E In, 

p/^N _ j {I - P^T^\l - p) if n is even 

^^'^"^ " \(l-p2)("+^)/2 ifnisodd. ^^-^^^ 

Since p = o(l) we have 1 — p ~ 1, and since N'^/"^ = o{p) we have (1 — p^)^ = o(l). 
Thus it is easy to see that 

my^] ~ 4- y (1-/)™ — = 4P^^ (2.24) 

m=0 ^ 

as claimed. This completes Step 1. 

For Step 2 we need the martingale machinery of Kim and Vu. 

We use notation consistent with [|Vu2L Consider a fixed A^, which shall tend to infinity 
in our estimates. Let := {0, l}^"*"^. Thus every subset A of In can be identified with 
an element of il. We are working in the probability space fi) where fx is the product 
measure with parameter p. For each A E n E In and x E {0, 1}, define 

Cn{x,A) := |E[^>o,-,an-i,a„ = a;]-E[^>o,-,an-i]|; (2.25) 

by E [S^'^\aQ, a^] we mean the expected value of the random variable given that 
for G {0, . . . , m} the element k is always (resp., never) in the subset if ak = 1 (resp., 
afc = —1). Let 

C{A) := maxCn{x,A). (2.26) 



Further put 



and 



Vn{A) := ['cl{x,A)dy=pCl{0,A) + {l-p)Cl{l,A) (2.27) 
^0 



N 



V{A) := (2.28) 



n=0 



For two arbitrary positive numbers V and C, define the event 

Bv,c := {A : C{A) > C or V{A) > V}. (2.29) 
Then the following is a specialization of a result appearing in [|Vu2|| : 
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Lemma 2.2. For any positive numbers A, V, C such that A < 4V/C^ we have 

P - E[^"] I > < 2e-^/^ + P(Bv,c). (2.30) 

We quickly sketch how Lemma completes the proof of assertion (iii) of Theorem 
fm We shall take 

A:=KologP, V := /^l(PlogP)^ C := KiPlogP. (2.31) 

We show that for appropriately chosen /ti, K2 we have 

P(Bv,c) = 0(1). (2.32) 

From (12.301) and limAr_^oo P = C)0, for sufficiently small kq we will then be able to 
conclude that 

= 0{P^^^\og^P) a.s. asiV^oo. (2.33) 

As Kiy^] ~ 4P^ (see (12.241) ). assertion (iii) in Theorem 1 1 . 1 [ follows immediately. Thus 
we are reduced to proving (12.321) . which we now proceed to do. 

First we simplify things a little. For any n E In and A E ^l,we introduce the shorthand 

'^n,A := ^1ao,...,a„_i. (2.34) 

Let 

AniA) := E[^„,A|an = 0] - El'^nAWn = !]• (2.35) 
As E[^„,A|a„ = 0] > E[^n,A\an = 1], we see A„(A) > 0. For x E {0, 1}, 

Cn{x,A) = |E[^„,A|a„ = x]- E[^„,a]|. (2.36) 

Since 

E[^,,^] = pEI^^^aK = I] + {I - pM^uaK = 0] 

= E['Wn,A\an = l] + il~p)AniA), (2.37) 
Cnix, A) from (12.251) simplifies to 

C^{l,A) = (l-p)A„(A), Cn{0,A) = pA„(A). (2.38) 
Since p < 1 — p for all sufficiently large N, we have then 

C{A) = (1 -p) max A„(A). (2.39) 

0<n<N 

Further, (IZ271) . (IZISl) and (12381) yield 

N 

V{A)=p{l-p)Y,Al{A). (2.40) 

n=0 

This completes our simplifications. 

Recall that in order to use (12.301) from Lemma [Z2l we need to prove (|2.32l) (namely 
that P(Bv,c) = 0(1)). The heart of the proof of (12.321) is to show that for an appropriate 
choice of /ti, /t2, ^3 > 0, with probability 1 — o(l) all three of the following events 
occur: 

J2 ^niA) = 0(1), (2.41) 

|7V-n|>K3P2 logP 
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max AJA) < K2P\ogP, (2.42) 

0<17V-n|<K3P2 logP 

V{A) < Ki{P log P)^. (2.43) 

We claim that (|2.41|) - (|2.43l) imply (12.321) . This follows immediately from applying the 
trivial bound P(lv,c) < ^{V > V)+F{C > C) and using (|24T]) - (IZ431) to show these 
two probabilities are both o(l). 

To summarize, the proof is completed by verifying (I2.41I) - (I2.43I) . Notice also that 
(12.411) and (12.421) . together with (12.401) . imply (12.431) for any choice of ki > k^ks, so it 
just remains to prove the former two. As in the arguments that follow there is a symme- 
try between n and N — n,we consider n with < n < N/2; the remaining n follow 
similarly. 

First, consider (12.411 ). Note that, depending on the parameter P, this sum could be 
empty. This will not affect the argument to follow. The proof is by an averaging argu- 
ment, i.e.: for each n we first consider Eao,...,a„_i[An(A)]. This quantity has a very nat- 
ural interpretation : in words, it is the expected increase in the size of the sumset A + A 
brought about by the addition of the number n to A. For every z G {n, ...,n + N}, 
adding nto A will add z to A + Aif and only if z — n E A and, for any other numbers 
ni, n2 such that rii + n2 = z, either rii ^ A or n2 ^ A. Let S'^ be the event that z gets 
added to A + Ahy the addition of n to A. Then using (12.231 ) we can explicitly estimate 

n+N n+N 

E.„...,.„_JA„(A)] = 5^PK] ~ p^(l-/)-''^iLfJ.L2i^J}. (2.44) 

z=n z=n 

Since n < N/2, the last sum is asympotically no more than 

L7V/2J 

2p J2 = (2 + o(l))P(l-p2)«/2. (2.45) 

r=Ln/2j 

By Markov's inequality, we deduce that for any n, 

P(A„(A) > 2P(1 - p2)"/4) < (1 + o(l))(l - p2)-/4_ (2.46) 
Then, just using a trivial union bound 

P(V'^n) < I]P('^n), (2-47) 

it follows that, with probability at least 

LJV/2J 

1- (1 + 0(1))- Yl (2-48) 

n= \K:jP^ log P] 

we have 

A„(A) < 2P(l-p2)"/^ for all n such that /t3P2 logP < n < (2.49) 
Then (|2.41l) clearly follows provided 

L7V/2J 

p- (i-p'r^' = o(i), (2.50) 

n= [ksP^ log P] 
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which is clearly the case for sufficiently large k-^, since 1 = o(P). 

We now turn to (I2.42I ). Firstly, a similar argument to the one just given shows that, 
even if n < n^P^ log P, adding n to a random set A is very probably not going to add 
any elements at all to A + A which are larger than k^P"^ log P. Secondly, among the 
numbers in I^.^pz logp, the addition to A of one number cannot add to A + A more num- 



bers than were in A already, plus maybe one more. But Chernoff's inequality ( HASL 
Corollary A. 14) implies that, with probability 1 — g-ciKsPiogP^ where ci is some univer- 
sal positive constant, | A fl /k3P2 p | < 2K3P log P. Then (12.421 ) follows from a simple 
union bound, as long as K2 > 2^3 for example. 

This completes the proof of the assertion of Theorem II. If iii) as regards the sumset. 

For the difference set, we proceed in two identical steps. First consider the estimate 
of E,[^^]. Let S'n now denote instead the event that n ^ A — A for each n E ±In- 
Clearly, 

N 

E[^'] = 2- J]P(^„) + o(l). (2.51) 

n=l 



N-n 



For each n > we have 

= A (2.52) 

m=0 

where ^^m.n is the (bad) event that both m and m + nlie'm A and ^m,n is the comple- 
mentary event. These events are not independent, but the dependencies will not affect 
our estimates. To see this rigorously, one can for example use Janson's inequality (see 
HASH . Chapter 8, though this is certainly overkill!) 

M < P < Mexp (y^) , (2.53) 

where all P(^^,0 < e, M = Um^(^m,n) and 

A = ^ P(i^„,„ A ^m',n), (2.54) 

the sum being over dependent pairs {m, m'}, i.e.: pairs such that m' = m + n. 
Note that we can take e = p^, we have M = (1 — p2^Af-n+i 

^ ^ f(iV-2n + l)p3 ifn<iV/2 
[0 ifn > N/2, 

since there is a 1-1 correspondence between dependent pairs and 3-term arithmetic pro- 
gressions in In of common difference n. It is then easy to see that this correction term 
can be ignored when we make the estimate 

E[S>'] ~ 2-y(l-p2)A^-"+i ~ 4 ^ 7:^^% (2.56) 
^ — ^ 2 

n=l 

as desired. 
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The concentration of 3)" about its mean can be established in the same way as we 
did with ,5^" above. A little more care is required in estimating quantities analogous 
to IEao,...,a„_i [^n(^)]5 bccausc of the dependencies between different representations of 
the same difference, but Janson's inequality can again be used to see rigorously that this 
will not affect our estimates. We omit further details and simply note that we will again 
obtain the result that 

|^^-E[^^]| = 0{P'^l'^\oi P) a.s. asAT^oo. (2.57) 

This completes the proof of Theorem ll.il □ 

3. General Binary Linear Forms 

We have the following generalization of Theorem ll.il : 

Theorem 3.1. Let p : M — > (0, 1) he a function satisfying (11.31) . Let u, v be non-zero 
integers with u > \v\, GCD{u, v) = 1 and {u, v) ^ (1, 1). Put f{x, y) := ux + vy. For 
a positive integer N, let Abe a random subset of In obtained by choosing each n E In 
independently with probability p{N). Let 3f denote the random variable \f{A)\. Then 
the following three situations arise : 

(i) p{N) = o(A^-V2) . j^^^ 

3f ^ {N ■p{N)f. (3.1) 

(ii) p{N) = c ■ N~^^'^ for some c E (0, oo) .• Define the function gu,v '■ (0, c>o) 
(0,M+ \v\) by 

9uA^) ■■= + 1^1) - M ) - - l^l)e""- (3.2) 



Then 

S>f ~ gu,v {-]N. (3.3) 



u 

(Hi) ^ o{p{N)) : Let ^ := {u + \v\)N - Then 

2u|t;| 

3f ~ —. 

f p{Nf 



(3.4) 



Proof. One follows exactly the method of proof of Theorem 11.11 so we only give a 
sketch here. 



Casel :p(A^) = 0{N 



We again write p for p{N). For any finite set A C No and any integer k > 1, let 

^'k,f ■= ^2), . . . , (a2fc-i, a2fc)} : /(ai,a2) = ■ ■ ■ = /(a2fc-i, a2fe)} . (3.5) 

Let X'l^j := I Then (12.211 ) has the following generalization : 

mif] - ( + ^) p''N'^\ (3-6) 



'^'^J \{k + l)\ k\ 



andX^ r ~ E[X^ .] whenever A^"'^^^ = o{p) 
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We shall just sketch the proof that ]E[X(, j] behaves like the right-hand side of (13.61) in 
the case when f > 0. The proof for t> < is similar, and the concentration of about 

its mean when N \ '^k ) = o(jp) is established by the same kind of second moment 
argument as in Section 2. 

If f > then for any A C we have f{A) C I(u+v)n- Then 

nKj] - ^kAN)-p'\ (3.7) 

where 

{u+v)N . , . s 
n=0 ^ ^ 

and R{n) denotes the number of solutions to the equation ux + vy = n satisfying 
(x, y) G /at X For any integer n, the general integer solution to ux + vy = n is of 
course 

X = nxo — vt, y = ny^ + ut, t G Z, (3.9) 
where uxq + vyo = 1. If n > then there are [n/uv\ +0(1) solutions in non-negative 
integers, and for all such solutions, {x, y) G I\-n/u\ x Iin/v\ ■ For n G I(u+v)n the fol- 
lowing three situations then arise : 

(I) n G IvN '■ then all non-negative solutions satisfy (x, y) E In x In, so R{n) = 
[n/uv\ + 0(1) in this case. 

(II) vN < n < uN : we have R{n) = [N/u\ + 0(1) for any such n. 

(III) uN <n<{u + v)N : we have R{n) = [{u + v)N - n]\ + 0(1) for these n. 

Thus it follows that 

n=0 ^ ' n=vN ^ ^ n=uN ^ 

~2-™E('^)+("-")iv(TM 0.11) 

~ 2ut;^^ + (« - .)ivi|^ (3-12) 

which, together with (|3.7I) . verifies our claim that IE[X^ f] behaves like the right-hand 
side of (1X61). 

Once we have (13.61) then, in a similar manner to Section 2, we can prove part (i) of 
Theorem |3.1| by noting that X2 j = o(X[ j) almost surely when p = o{N^^/'^), and part 
(ii) by showing that 

00 

% ~ Y.{-lf-'X',j (3.13) 

k=l 

when p = ciV~^/^. It's a simple exercise to check that (13.131) and (13.61) yield (13.31) . 
Casell: _ o(p(Ar)). 



(3.10) 



WHEN ALMOST ALL SETS ARE DIFFERENCE DOMINATED 



15 



We give a sketch of the estimate for E[i^j], the details of the concentration estimate 
being completely analogous to what has gone before. Let us continue to assume v > 0, 
the proof for v < being similar. As in the proof of Theorem 1 1 . 1 T iii) one may check that 
various dependencies do not affect our estimates which, using observations (I),(n),(ni) 
above, lead to 

vN uN {u+v)N 

^[^/] ~ ^(1 + (1 -P')"^^" + (1 (3.14) 

n=0 n=vN n=uN 

lN/u\ 

~ 2 ■ ut; ^ (1 - p2)" + _ v)N{l - p2)^/«. (3.15) 

n=0 

The sum is ~ and the second term is negligible since 1 = o(Np'^), so E[^j] ~ 
2uv /p^, as claimed. 

This completes the proof of Theorem 13. II □ 

As mentioned earlier, the main result of nNOORS H was that, for any two binary forms 
/ and g, including the case when g{x,y) = x + y, there exist finite sets Ai,A2 of 
integers such that |/(y4i)| > |5'(Ai)| and |/(y42)| < |(7(v42)|. Theorem 13. II has anumber 
of consequences on the matter of comparing |/(v4)| and 1(7(^4)1 for given / and g and 
random subsets A of In for large N. We now reserve the notations f(x, y) := Uix + viy 
and g{x, y) := U2X + V2y for two forms being compared. Unless otherwise stated, we 
assume neither / nor g is the form x + y. A generic form ux + vy will be denoted 
h{x,y). 

It is convenient to formalize a piece of terminology which we used informally in the 
introduction : 

Definition 3.2. Let f,g be two binary linear forms as above. Let p : N (0, 1) 
satisfy (|1.3I) . Then we say that f dominates g for the parameter p if as N ^ 00, 
\f{A) I > \g{A) I almost surely when A is a random subset of In obtained by 
choosing each n E In independently with probability p{N). 

We now consider three different regimes (depending on how rapidly p{N) decays). 
In the arguments below we shall write p for p{N). The most interesting behavior will 
be isolated afterwards as Theorem 13. 3 1 

Regime 1 : N-'^/^ = o(p(A^)). 

Then part (iii) of Theorem 13.11 implies, in particular, that ~ (ui + \vi\)N and 
~ (^2 + \v2\)N. Hence / dominates g when -ui + |fi| > ^2 + |t'2|. In particular, 
this is the case if g{x,y) = x — y. If ui + \vi\ = M2 + |f2| then the theorem says that / 
dominates g if and only if Ui\vi\ < U2\v2\, which is the case if and only if ui > U2- 

Regime! : p = o{N-^/'^). 

Part (i) of Theorem 13.11 says that ~ S)g ~ (^Np)'^ for any / and g. For every 
/c > 1 we have 

= Qk^uAN'^V)- (3-16) 



16 



PETER HEGARTY AND STEVEN J. MILLER 



Thus 

XUi,h = 0{Np^Xl^) = o{Xl^) almost surely. (3.17) 
The second moment method gives standard deviations 



,) = efc,„,, (^E[X^] + a) = 0fc,„,,(max{iv'^'/,iV^+V"n)- (3.18) 

In particular we have a{X[j^) = Q {[Npf/"^) and ^ = e(iVV)- First of all, then, 
if (XV) = 0((Xp)^/^), i.e.: if p = 0{N'^^^), then the uncertainty in the size of the 
random set A itself swamps everything else, and our model is worthless. 

If N^^/^ = o{p) then, by (13.171) . it is in the first instance the X'2 /j-term which will be 
decisive. By (13.61 ) we have 



Hence / dominates g in this range of 5 if t>i) < a{u2, V2) where 

1 / |f I u — |f I \ ?>u — \v 



a^t;) := — I V + ) = ..J • (3.20) 



Since it is easy to see that a{uiVi) = a{u2, V2) if and only if ui = U2, f 1 = ±f 2, this 
allows us to compare any pair of forms in the range A^~^/^ = o{p) and p = o(A^~^/^), 
except a pair ux ± vy. But for such a pair, our methods are entirely worthless any- 
way, since all the estimates in this section depend only on \v\. Note in particular that 
a{u, v) < a{l, —1) for any (u, v) 7^ (1, —1) so that any other form dominates x — y. 

Regime 3 : p = cN^^^'^. 

By part (iii) of Theorem 13.11 for a given value of the parameter c G (0, 00), / dom- 
inates g if 

9uuvi f — ) > 9u2,v2 {—] ■ (3.21) 

Since gu,v{x) ^ m + l^;! as x 00, / will dominate g for sufficiently large values of c, 
provided Mi + |f i| > M2 + |^2|. This is as expected from Regime 1. On the other hand, 
the Taylor expansion of t,, as a function of c, around c = 0, reads 

gu,v{(^) = - a{u,v)c^ + Ou,v{c^)- (3.22) 

Thus / dominates g for sufficiently small values of c provided a{ui, vi) < a{u2, ^2). 
Again this is as expected, this time from Regime 2. Note that the injectivity of a allows 
us to even compare forms with the same value of u + namely : for a fixed value of 
u + \v\, a{u,v) is clearly a decreasing function of u. Hence if ui + \vi\ = U2 + |f2| 
then / dominates g for all values of c G (0, 00) if and only if ui > U2. Note that this is 
the same condition as in Regime 1 . More generally, we have that / dominates g for all 
values of c whenever a{ui, vi) < a{u2, V2) and Ui + |f i| < ^2 + \v2\- In particular this 
is the case for g{x, y) = x — y and any other /. 

The most interesting phenomenon arises when we compare two forms such that 

+ |fi| > M2 + |'W2| and a{ui,Vi) > a{u2,V2)- (3.23) 
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Then the combined observations of Regimes 1 , 2 and 3 imply that there 

exists some 
Cf^g > such that 



g dominates / whenever N '^^^ = o(p) and p = o{N ^/^) 
or p = cN'^/"^ for any < c < c/,g, 



(3.24) 



whereas 



/ dominates g whenever p = cN for any c > Cf^g or = o{p). (3.25) 

This observation may be considered a partial generalization of the main result of FNO ORSlI 
to random sets, partial in the sense that it only applies to pairs of forms satisfying (13.231) . 
Equations (13.241) and (|3.25l) say that we have a sharp threshold, below which g domi- 
nates / and above which / dominates g. We leave it to future work to determine what 
happens as one crosses this sharp threshold. 

We close this section by summarizing the most important observations above in a theo- 
rem. 

Theorem 3.3. Let /(x, y) = Uix + U2y and g{x, y) = U2X + g2y, where Ui >\vi\> 0, 
GCD{ui, f j) = 1 and {u2, V2) 7^ (mi, ±f 1). Let a : Z^q — > Q Z^e the function given by 
(|3.20l) . The following two situations can be distinguished : 

(i) Ui + \vi\ >U2 + \v2 \ and a{ui, vi) < a{u2, f2)- 

Then f dominates g for all p such that N~^/^ = o{p) and p = o(l). In particular, 
every other difference form dominates the form x ~ y in this range. 

(ii) Ui + \vi\ > U2 + 1^2! and a{ui, Vi) > a{u2, ^2)- 

Then there exists Cf^g G such that (13.241) and (13.251) hold. Specifically, cj^g is the 
unique positive root of the equation 



Here is a sample of issues which could be the subject of further investigations : 

1. One unresolved matter is the comparison of arbitrary difference forms in the range 
where N'^^^ = 0{p) and p = 0{N^^/^). Here the problem is that the binomial model 
itself does not prove of any use. This provides, more generally, motivation for looking 
at other models. Obviously one could look at the so-called uniform model on subsets 
(see lULRH ). but this seems a more awkward model to handle. Note that the property of 
one binary form dominating another is not monotone, or even convex. 




(3.26) 



where gu,v{x) '■ ^ 



+ 



(0, M + |f I) is given by (|3.2I) . 



4. Open Problems 
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2. Secondly, a very tantalizing problem is to investigate what happens while cross- 
ing a sharp threshold, whenever it arises under the conditions of Theorem |3.3f ii). 

3. Thirdly, one can ask if the various concentration estimates in Theorem 11.11 can be 
improved. When p = o{N~^/'^) we have only used an ordinary second moment argu- 
ment, and it is possible to provide explicit estimates. Explicitly, the following follows 
from Chebyshev's Theorem (see the appendix for a proof). 

Theorem 4.1. Let p{N) := cN'^ for some c > 0, 5 G (1/2, 1). Set C := max(l,c), 
f(S) := min{i, ^^^} and let g{5) be any function such that < g{5) < f{5) for all 
5 e (1/2,1). SetPi{N) := (4/c)iV-(i-^) and P2{N) := N'(f(^)-9m. For any subset 
chosen with respect to the binomial model with parameter p = p{N), with probability at 
least 1 — Pi ( A^) — P2 {N) the ratio of the cardinality of its difference set to the cardinality 
of its sumset is 2 + Oc{N^9^^^). Thus the probability a subset chosen with respect to 
the binomial model is not difference dominated is at most Pi{N) + P2{N), which tends 
to zero rapidly with N for 5 G (1/2, 1). 

The range N'^^"^ = o(p(N)) seems more interesting, however. Here we proved that 
the random variable has expectation of order P(Ny, where P{N) = l/p(N), and 
is concentrated within P(iV)^/^ log^ P{N) of its mean. Now one can ask whether the 
constant 3/2 can be improved, or at the very least can one get rid of the logarithm? 

4. Finally, it is natural to ask for extensions of our results to Z-linear forms in more 
than two variables. Let 



be such a form. We conjecture the following generalization of Theorem 3.1: 

Conjecture 4.2. Let p : N ^ (0,1) be a function satisfying (11.31) . For a positive integer 
N, let A be a random subset of 1^ obtained by choosing each n & 1^^ independently 
with probability p{N). Let f be as in (4.1) and assume that GCD{ui, m„) = 1. Set 



Let '3>j denote the random variable \ f{A)\. Then the following three situations arise : 

(i) p{N) = o{N-'^/'') : Then 

% ~ ^{N-p{N))\ (4.3) 

(ii) p{N) = c ■ N~^^^ for some c G (0, 00) .• There is a rational function R{xq, Xk) 
ink + 1 variables, which is increasing in Xq, and an increasing function 

gui,...,uk '■ (0, 00) (0, Yl^^i such that 



f{xi,...,Xk) = UiXi-\ \-UkXk, UiEZ^o, 



(4.1) 



'■= G Sfc : (?V(i), ...,M^(fc)) = (Mi, ...,Mfc)}. 



(4.2) 



(P(c,Mi, ...,Uk)) ■ N. 



(4.4) 




29 J Ul 



(4.5) 
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Appendix A. Explicit bounds 

Here we prove Theorem |4.1[ The proof uses essentially only Chebyshev's inequality. 
We have deliberately made this section entirely self-contained, rather than appealing to 
results from [AS J, for the benefit of readers who may not be too familiar with discrete 
probability theory. 

We first establish some notation, and then prove a sequence of lemmas from which 
Theorem 14. 1 1 immediately follows. Our goal is to provide explicit bounds which decay 
like to a power. 

Let Xn-N' denote the binary indicator variable for n being in a subset (it is thus 1 
with probability cN^^ and otherwise), and let X be the random variable denoting the 
cardinality of a subset (thus X = J2n ^n;N)- For two pairs of ordered elements (m, n) 

and (m', n') in 1^ x I]\r (m < n, m! < n'), let Ym,n,m.',n' = lifn — m = n' — m', and 
otherwise. 



Lemma A.l. With probability at least 1 — Pi(N), 



X G 



-cN'-\ -cN'-' 
2 ' 2 



(A.l) 



Let O denotes the number of ordered pairs {771,11) (with m < 71) in a subset of In 
chosen with respect to the binomial model. Then with probability at least 1 — Pi{N) we 
have 

^cN^-^ ( iciV^-^ - 1) lcN^~^ ( ^cN^-^ - 1) 

2 V2 J_ < < 2 \2 J_ .^ 2) 

2 - - 2 ^ ^ 

Proof We have E[X] = IE[X„.Ar] = cN^^K As the X^-n are independent. 
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4„^^ = N {cN-' - c'N'^') . (A.3) 

n 

Thus 

ax < Vc-N^. (A.4) 

By Chebyshev's inequality, 

ProbflX - cN^-^\ < kax) > 1 - i^- (A.5) 

k^ 

ForX e [IcN^-^, I cX^""^] we choose A; so that 

kax = ^cN^'^ < k^cN^. (A.6) 

Thus k >\ y/cN^^'^^^"^ , and the probability that X lies in the stated interval is at least 
1 — (cN^'^ / A)^^ . The second claim follows from the fact that there are (2) ways to 
choose two distinct objects from r objects. □ 



20 



PETER HEGARTY AND STEVEN J. MILLER 



Remark A.2. By using the Central Limit Theorem instead of Chebyshev's inequality 
we may obtain a better estimate on the probability of X lying in the desired interval; 
however, as the Central Limit Theorem is not available for some of the later arguments, 
there is negligible gain in using it here. 

Proof of Theorem 4.1. By Lemma IaTTI (IA.2I) holds with probability at least 1 — 
Pi{N). The main contribution to the cardinalities of the sumset and the difference set 
is from ordered pairs (m,n) with m < n. With probability at least 1 — Pi{N) there 
are on the order N'^^'^^ such pairs, which is much larger than the order N^^^ pairs with 
m = n. The proof is completed by showing that almost all of the ordered pairs yield 
distinct sums (and differences). Explicitly, we shall show that for a subset chosen from 
In with respect to the binomial model, if O is the number of ordered pairs (which 
is of size N"^""^^ with high probability), then with high probability the cardinality of its 
difference set is 20 + Oc{N^'^^) while the cardinality of its sumset is O + Oc{N'^'^^)- 
This argument crucially uses 5 > 1/2 (if 5 = 1/2) then the error term is the same size 
as the main term, and the more delicate argument given in the main text is needed). 
We shall show that almost all of the ordered pairs generate distinct differences; the 
argument for the sums follows similarly. 

Each ordered pair (m, n) yields two differences {m — n and n — m). The problem 
is that two different ordered pairs could generate the same differences. To calculate 
the size of the difference set, we need to control how often two different pairs give the 
same differences. Consider two distinct ordered pairs (m, n) and (m', n') with m < n 
and m' < n' (as the A^^^*^ ^ JSf"^-"^^ 'diagonal' pairs (n, n) yield the same difference, 
namely 0, it suffices to study the case of ordered pairs with distinct elements). Without 
loss of generality we may assume m < m' . If n — m = n' — m' then these two pairs 
contribute the same differences. There are two possibilities: (1) all four indices are 
distinct; (2) n = m'. 

We calculate the expected number of pairs of non-diagonal ordered pairs with the 
same difference by using our binary indicator random variables Ym,n,m',n'- Set 

Y — ^ ^ ^ ^ ^ ^ ^t,n,m',n'- (A. 7) 

l<m<m'<N m'<n'<N m<n<N 

— m' —n — m 

If the four indices are distinct then E[Fm „ m' n'] = c^N'"^^; ifn = m' then E[Fm „ m' «/] = 

The number of tuples (m, n, m', n') of distinct integers satisfying our conditions is 
bounded by A^'^ (once m, n and m' are chosen there is at most one choice for n' E 
{m + 1, . . . , A^} with n' — m' = n — m£|. If instead n = m' then there are at most 
A^^ tuples satisfying our conditions (once m and n are chosen, m' and n' are uniquely 
determined, though they may not satisfy our conditions). Therefore 

E[Y] < ■ c^N-"^^ + N"^ ■ c^N-^^ < 2C^A^3-45 (^_g^ 

as6 e (1/2,1). 

As A^'^"'^'^ is much smaller than N"^"^^ for 5 > 1/2, most of the differences are 
distinct. To complete the proof, we need some control on the variance of F. In Lemma 



' Although we do not need the actual value, simple algebra yields the number of tuples is + 

0{N^). 
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IA.3l we show that 

ay < rC^N'^^l (A.9) 

where 

2r{5) = max{3 - 45, 5 - 75}. (A.IO) 

While we cannot use the Central Limit Theorem (as the Ym,n,m',n' are not independent 
and also depend on A^), we may use Chebyshev's inequality to bound the probability 
that Y is close to its mean (recall the mean is at most 2C^N^~'^^). We have 

Prob(|F-E[r]| < fcay) > 1 - (A.ll) 

Simple algebra shows that if we take k = A^2-2(5-r(<5)-c/(5) ^^^^ ^^^^y probability at least 
1 _ N~{f{S)-9m we have Y < ^^2-25-9(5) ^ ^^^^^ ^ positive power of N less than 
jY2-2<5 ^Ym^ at most negligible amount of the differences are repeated. 

The argument for two ordered pairs yielding the same sum proceeds similarly: if 
yu + z/ = /i' + z/' then v — fi' = u' — fi. 

For our ratio to be 2 + Oc{N~^^^^), two events must happen. As the probability 
the first does not occur is at most Pi ( A^) and the probability the second does not oc- 
cur is at most P2{N), the probability that the two desired events happen is at least 
l-Pi{N)-P2iN). 

Except for the claimed estimate on ay, the above completes the proof of Theorem 14. 1[ 
We now prove our bound for ay- 

Lemma A.3. Let the notation be as in Theorem \4.1\ and (A.IO). We have 

ay < 7C^A^"(^). (A. 12) 

Proof. If U and V are two random variables, then 

Var(t/ + V) = Var([/) + Var(V) + 2CoVar(f/, V). (A. 13) 

By the Cauchy-Schwartz inequality, CoVar([/, V) < A/Var(t/)Var(y). Thus 

Var(f/ + 1/) < 3Var(f/) +3Var(F). (A.14) 

We may therefore write 

^ ^ Yra,n,m' ,n' ^ ^ Um,n,m' ,n' ~\~ ^ ^ '^m,n,n' U -\- V, (A.15) 

where in the U -sum all four indices are distinct (with l<m<m'<N,m<n<N, 
m' < n' < N and n — m = n' — m') and in the \^-sum all three indices are distinct (with 
l<m<n<n'<N and and n — m = n' — n). As Var(F) < 3Var(f/) + 3Var(\^), 
we are reduced to bounding the variances of U and V . 
We first bound Var(t/). Standard algebra yields 



Var(f/) = Var f/m,„,m',n') 



^ Var(f/„ 

,n,m' ,n' 

) + 2 CoVar(f/™ 

{m,n,m' ,n')^{m,n,fh' ,n') 

(A.16) 
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As Var(?7m,n,m',n') = c^N '^^ — c^N and there are at most ordered tuples 
(m, n, m', n') of distinct integers with n — m = m' — n' , the Var(t/^^„_m/^„/) term 
is bounded by c^Ar3-45. 

For the covariance piece, if all eight indices (m, n, m' , n' , rh, n, rh! , n') are distinct, 
then Um,n,m',n' and Um.fi^m',n' are independent and thus the covariance is zero. There are 
four cases; in each case there are always at most N"^ choices for the tuple (m, n, m', n'), 
but often there will be significantly fewer choices for the tuple (m, n, m', n'). We only 
provide complete details for the first and third cases, as the other cases follow similarly. 

• Seven distinct indices: There are at most N"^ choices for {m,n,rn! ,n'). The 
covariance of each such term is bounded by c^N'"^^. To see this, note 

The product of the expected values is c^N~^^, while the expected value of the 
product is c' N~'^^ . Thus the covariances of these terms contribute at most 

• Six distinct indices: The covariances of these terms contribute at most c^N^^^^ . 

• Five distinct indices: The covariances of these terms contribute at most c'N^'^^ 
(once three of the m, n, m', n' have been determined, the fourth is uniquely 
determined; thus there are at most A^^ choices for the first tuple and at most 1 
choice for the second). 

• Four distinct indices: The covariances of these terms contribute at most c^N^"^^. 

The A^-dependence from the case of seven distinct indices is greater than the A^-dependence 
of the other cases (except for the case of four distinct indices if 5 > 2/3). We also only 
increase the contributions if we replace c with C = max(c, 1). We therefore find 

Var(C/) < C^A^3-45 ^ ^ {C'^N^-^^ + C^N^-^^ + C^N^-^^ + C^N^'^^) 

= SC^AT^-^'^ + eC^AT^-^^. (A.18) 
Similarly we have 

Var(y) = Var( J] Kn,n,n') 

(A. 19) 

The Var(ym,n,n') piece is bounded by A^^ • c^N"^^ (as there are at most A^^ tuples with 
n' — n — n — m). The covariance terms vanish if the six indices are distinct. A similar 
argument as before yields bounds of c^A^3-55 distinct indices, c^N"^^^^ for four 

distinct indices, and c^A^^~^^ for three distinct indices. The largest A^-dependence is 
from the c^A"^"^'' term (as 5 > 1/2). Arguing as before and replacing c with C yields 

Var{V) < C^N^-^^ + 2- 3C^N^-^^ < IC^N'^-^^. (A.20) 

As 5 < 1, 2 - 35 < 3 - 45. Therefore 

Var(F) < 3 ■ (3C^A='-^^ + eC^Ar^"^^) + 3 ■ IC^N"^-^^ 

< 30C*N^-*^ + 18C^N^-^^ < 49CSA^2r(5)^ (^_21) 
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which yields 

ay < rC^N'^^^l (A.22) 

□ 

Remark A.4. An extreme choice of g would be to choose g(5) = e, for some small 
positive constant e. Since f(5) > 1/4 for all 5 E (1/2, 1), we then obtain a bound of 
2 + Oc{N^'') for the ratio of the cardinality of the difference set to the sumset with 
probability 1 - Oc{N-'^'''^^-^^i-'^). 

Remark A.5. Alternatively, one can get a tighter bound on the ratio than in Theorem 
14.11 at the expense of having a bound on the probability which is non-trivial only for 
S < Sraax < 1- For cxamplc, if one instead chooses k = A^3-4<5-r((5) (A. 11) and 
P2{N) := jv-(6-85-2r(5))^ ^Yien the statement of Theorem gl] still holds, but is a non- 
trivial statement only for 6 — 86 — 2r(S) > 0, i.e.: for S < 3/4. This is a natural choice 
of (5max5 given the results of HGJLRL Specifically, if 5 > 3/4 then almost surely no 
differences or sums are repeated, and the set is a Sidon set (and therefore we trivially 
have the ratio of the cardinality of the difference set to the sumset is approximately 2). 
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